Optimal Algorithms for Testing Closeness of Discrete Distributions | Proceedings of the Twenty-Fifth Annual ACM-SIAM Symposium on Discrete Algorithms | Society for Industrial and Applied Mathematics

نویسندگان

  • Siu-On Chan
  • Ilias Diakonikolas
  • Gregory Valiant
چکیده

We study the question of closeness testing for two discrete distributions. More precisely, given samples from two distributions p and q over an n-element set, we wish to distinguish whether p = q versus p is at least ε-far from q, in either `1 or `2 distance. Batu et al [BFR00, BFR13] gave the first sub-linear time algorithms for these problems, which matched the lower bounds of [Val11] up to a logarithmic factor in n, and a polynomial factor of ε. In this work, we present simple testers for both the `1 and `2 settings, with sample complexity that is information-theoretically optimal, to constant factors, both in the dependence on n, and the dependence on ε; for the `1 testing problem we establish that the sample complexity is Θ(max{n/ε, n/ε}).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fair Scheduling via Iterative Quasi-Uniform Sampling | Proceedings of the Twenty-Eighth Annual ACM-SIAM Symposium on Discrete Algorithms | Society for Industrial and Applied Mathematics

In the paper we consider minimizing the `k-norms of flow time on a single machine offline using a preemptive scheduler for k ≥ 1. We show the first O(1)approximation for the problem, improving upon the previous best O(log logP )-approximation by Bansal and Pruhs (FOCS 09 and SICOMP 14) where P is the ratio of the maximum job size to the minimum. Our main technical ingredient is a novel combinat...

متن کامل

An Efficient Algorithm for Computing High-Quality Paths amid Polygonal Obstacles | Proceedings of the Twenty-Seventh Annual ACM-SIAM Symposium on Discrete Algorithms | Society for Industrial and Applied Mathematics

We study a path-planning problem amid a set O of obstacles in R, in which we wish to compute a short path between two points while also maintaining a high clearance from O; the clearance of a point is its distance from a nearest obstacle in O. Specifically, the problem asks for a path minimizing the reciprocal of the clearance integrated over the length of the path. We present the first polynom...

متن کامل

Combinatorial Auctions via Posted Prices | Proceedings of the Twenty-Sixth Annual ACM-SIAM Symposium on Discrete Algorithms | Society for Industrial and Applied Mathematics

We study anonymous posted price mechanisms for combinatorial auctions in a Bayesian framework. In a posted price mechanism, item prices are posted, then the consumers approach the seller sequentially in an arbitrary order, each purchasing her favorite bundle from among the unsold items at the posted prices. These mechanisms are simple, transparent and trivially dominant strategy incentive compa...

متن کامل

On Rationality of Nonnegative Matrix Factorization | Proceedings of the Twenty-Eighth Annual ACM-SIAM Symposium on Discrete Algorithms | Society for Industrial and Applied Mathematics

Nonnegative matrix factorization (NMF) is the problem of decomposing a given nonnegative n × m matrix M into a product of a nonnegative n×d matrix W and a nonnegative d×m matrix H. NMF has a wide variety of applications, including bioinformatics, chemometrics, communication complexity, machine learning, polyhedral combinatorics, among many others. A longstanding open question, posed by Cohen an...

متن کامل

Testing Soccer League Competition Algorithm in Comparison with Ten Popular Meta-heuristic Algorithms for Sizing Optimization of Truss Structures

Recently, many meta-heuristic algorithms are proposed for optimization of various problems. Some of them originally are presented for continuous optimization problems and some others are just applicable for discrete ones. In the literature, sizing optimization of truss structures is one of the discrete optimization problems which is solved by many meta-heuristic algorithms. In this paper, in or...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013